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Summary 


This  report  presents  the  resuiis  of  research  towards  an  ontology  capture  tocthod  referred  to 
as  IDEF5.  Viewed  simply  as  the  study  of  what  there  is  in  a  domain,  ontology  is  at  work 
across  the  full  range  of  human  inquiry  prompted  by  the  persisicni  effort  to  understand  the 
world  in  which  it  has  found  itsclf--and  which  it  has  helped  to  shape.  In  the  context  of 
information  management,  ontology  is  the  task  of  extracting  the  structure  of  a  given 
engineering,  manufacturing,  business,  or  logistical  domain  and  storing  it  in  a  usable 
representational  medium.  A  key  to  effective  integration  is  a  system  ontology  that  can  be 
accessed  and  modified  across  domains  and  which  captures  common  tcatures  of  the  overall 
system  relevant  to  the  goals  of  the  disparate  domains.  If  the  focus  is  on  information 
integration,  then  the  strongest  motivation  for  ontology  comes  from  the  need  to  support  data 
sharing  and  function  interoperability.  In  the  correct  architecture,  an  enterprise  ontology 
base  would  allow  the  construction  of  an  integrated  environment  in  which  legacy  systems 
appear  to  be  open-architecture,  integrated  resources.  If  the  focus  is  on  systcm/sofiware 
development,  then  support  for  the  rapid  acquisition  of  reliable  systems  is  perhaps  the 
strongest  motivation  for  ontology.  Finally,  ontological  analysis  has  been  demonstrated  to 
be  an  effective  first  step  in  the  construction  of  robust  knowledge-based  systems. 

An  IDEF5  description  of  an  ontology  is  a  computationally  tractable  representation  of  what 
exists  in  a  given  domain.  IDEF5  provides  the  means  to  identify  the  primary  classes,  or 
kinds,  of  objects  within  the  domain  by  isolating  the  properties  that  define  the  members  of 
those  kinds  and  the  characteristic  relations  that  hold  between  domain  objects.  IDEF5 
allows  such  representations  to  be  purposely  structured  in  a  way  that  closely  reflects  human 
conceptualization  of  the  domains  in  question.  In  IDEF5.  differing  perspectives  on  the  same 
domain  (e.g.,  varying  levels  of  granularity)  and  their  interrelations  are  also  supported. 
Finally,  IDEF5  supports  the  identification  of  complex  kinds  {system  kinds)  and  the 
properties  and  relations  that  characterize  members  of  those  kinds. 


Part  I:  Background,  Motivation,  and  Informal  Foundations 

Any  organized  system--a  business,  university,  manufacturing  plant  -can  be  considered  the 
resultant  of  three  vectors. 

(1)  the  system  ontology,  i.e.,  the  basic  entities  that  populate  the  system- -personnel, 
equipment,  manufacturing  systems,  etc.; 

(2)  the  structure  tho.se  entities  jointly  exhibit--the  relations  they  bear  to  one  another, 
and 

(3)  the  processes  they  undergo--thc  changes  that  take  place  in  the  organization  over 
tinK. 

An  accurate  representation  of  such  a  system  will  thus  reflect  the  information  within  all  three 
vectors.  Currently,  existing  IDEF’  methods  arc  geared  chiefly  toward  information  of  the 
second  and  third  types;  IDEFl  (Information  Modeling  Method)  and  IDEFIX  (Semantic 
Data  Modeling  Method)  capture  primarily  structural  information;  IDEFO  (Function 
Modeling  Method)  and  IDEF3  (Process  Description  Capture  Method)  capture  various  i>'pc.s 
of  process  information.  Of  course,  since  both  structural  information  and  process 
information  involve  objects  in  a  system,  there  is  the  capacity  for  limited  ontology 
representation  within  the  existing  methods.  However,  as  noted  below,  several  imponant 
kinds  of  ontological  information  are  not  representable  in  the  languages  associated  with 
these  methods.  Furthermore,  these  methods  do  not  include  techniques  specifically 
designed  for  eliciting  and  capturing  system  ontologies.  This  suggests  that  there  is  a  need 
for  a  separate  method.  We  intend  to  substantiate  this  suggestion  and  begin  laying  the 
groundwork  for  the  needed  method,  IDEF5. 

Like  other  IDEF  methods,  IDEF5  should  be  accompanied  by:  (1)  a  rigorous,  formal 
foundation  for  the  method  and  (2)  an  accompanying  software  implementation  designed  for 
practical  information  capture  and  information  modeling.  The  software  tool  would  be 


^  The  name  IDEF  originates  from  the  Air  Force  program  for  Integrated  Computer-Aided  Manufacturing 
(ICAM)  from  which  the  first  ICAM  Definition,  or  IDEF,  mcihod.s  emerged.  It  was  in  recognition  of  this 
foundational  work,  and  in  support  of  an  overall  .strategy  to  provide  a  family  of  muluaJly-supportivc  methods 
for  enterprise  integration,  that  continued  development  of  IDEF  technology  was  undertaken. 
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designed  for  use  by  domain  expens- -people  attuned  to  the  wav  a  specific  system  works. 
The  basic  question  faced  by  any  domain  expen,  or  bv  a  knowledge  engineer  working  with 
such  an  expert,  is  how  to  describe  the  things  he  or  she  knows  about.  A  good  ntcihcxJ  will 
reveal  the  appropriate  sons  of  general  structures  that  classify  the  knowledge  being  sought 
smoothly  and  flexibly-the  formal  foundation--and  provide  a  rich,  powerful,  user-friendly 
environment  for  eliciting  that  information  from  the  expen,  then  store  and  integrate  the 
garnered  information  efficiently  and  effectively.  In  the  following  sections,  we  desenbe  the 
nature  of  ontology  and  ontological  information,  sketch  the  proposed  IDEF5  formal  and 
methodological  foundations  for  capturing  that  information,  and  discuss  the  general 
proposed  features  of  an  IDEF5  software  environment. 

Philosophical  Foundations:  The  Nature  of  Ontology 

In  Western  thought,  ontology  has  chiefly  been  considered  an  attempt  to  divide  the  world  at 
its  joints.  In  a  word,  it  can  be  called  the  study  of  what  there  is.  Historically,  ontology 
arose  as  the  major  component  of  the  branch  of  philosophy  known  as  metaphysics,  which 
deals  with  the  nature  of  reality  in  general.  Metaphysics  is  perhaps  most  often  associated 
with  questions  typically  taken  to  be  beyond  the  reach  of  physical  science,  such  as  the  nature 
of  the  soul  or  the  mind,  the  existence  of  God,  or  whether  or  not  we  have  free  will. 2 
However,  there  is  no  necessary  connection  between  ontology  and  pure,  nonempirical, 
philosophical  speculation.  Viewed  simply  as  the  study  of  what  there  is.  ontology  is  an 
activity  at  work  across  the  full  range  of  human  inquiry  prompted  by  humanity's  persistent 
effort  to  understand  the  world  in  which  it  exisLs-and  which  it  has  helped  to  shape. 

Natural  science,  in  particular,  can  be  viewed  as  an  example  of  ontology  par  excellence. 
Perhaps  the  chief  goal  of  subatomic  physics,  for  example,  is  to  develop  a  taxonomy  of  the 
most  basic  kinds  of  objects  that  exist  within  the  natural  world— electrons,  protons,  muons, 
and  their  fellows.  At  the  other  end  of  the  spectrum,  astrophysics,  among  other  things, 
seeks  to  discover  the  range  of  objects  that  exist  in  its  domain:  quasars,  black  holes,  gravity 
waves,  etc.  Similarly,  the  so-called  life  .sciences  seek  to  categorize  and  describe  the  various 
kinds  of  living  organisms  that  populate  the  planet.  Such  examples  can  be  multiplied,  of 
course,  from  geology  to  psychology,  chemistry  to  sociolinguistics. 


2  Unfortunately,  in  contrast  to  these  deep,  important-albcit  often  unrc.solvablc--quc.siion.s,  in  the  popular 
consciousness,  the  term  "metaphysics"  has  also  come  to  be  associated  with  such  pscudo-intcUcctual  bilge  as 
astrology,  astral  projection,  occult  “science,”  and  similar  noascnsc. 
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This  sort  of  inquiry  is  not  limited  to  the  natural  sciences,  however.  The  absu^act  sciences  as 
weil-mathemaiics,  in  particular-are  at  least  in  pan  an  attempt  to  discover  and  categcrize  the 
domain  of  abstract  objects;  prime  numbers,  transfinitc  ordinals,  Hilben  spaces,  continuous 
nondifferentiable  functions,  polynomial  algorithms,  commutative  groups,  and  so  on. 

The  natural  and  abstract  worlds,  however,  do  not  exhaust  the  applicable  domains  of 
ontology.  There  are  vast,  human-designed-and-engineered  systems— manufacturing  plants, 
businesses,  military  bases,  etc,— in  which  the  task  is  just  as  relevant  and  just  as  pressing. 
Here,  though,  the  ontological  enterprise  is  motivated  not  so  much  by  the  search  for 
knowledge  for  its  own  sake  (as,  ideally,  it  is  in  the  natural  and  abstract  sciences)  but  by  the 
need  to  understand,  design,  engineer,  and  manage  such  systems  effectively. 

Ontology,  then,  is  a  basic  research  task  common  to  the  natural  and  abstract  sciences  on  the 
one  hand,  and  common  to  the  information  sciences  on  the  other.  In  the  next  section,  wc 
present  the  nature  of  ontological  information  in  greater  detail  and  discuss  its  application  to 
the  information  sciences. 

Kinds  and  Instances 

Ontology  can  be  understood  to  involve  several  subtasks;  four  are  especially  worth 
discussing  here:  (i)  providing  an  inventory  of  the  kinds  of  objects  that  exist  within  a  given 
domain  according  to  our  best  sources  of  information  regarding  that  domain  (c.g.,  a  theory 
or  a  domain  expen);  (ii)  for  each  kind  of  object,  providing  a  description  of  the  properties 
common  to  all  and  only  instances  of  that  kind;  (iii)  characterizing  the  particular  objects  that 
in  fact  instantiate  the  kinds  within  a  system;  and  (iv)  providing  an  inventory  of  the 
associations  that  exist  within  a  given  domain  between  (and  within)  kinds  of  objects. 

The  first  two  tasks  are  common  in  the  physical  sciences.  Thus,  in  microphysics,  for 
example,  one  finds  the  subatomic  world  grouped  into  basic  kinds— at  the  grossest  level  (in 
the  context  of  subatomic  physics!),  leptons  and  quarks,  and  beneath  them  the  large  variety 
of  subkinds  of  each  of  those  overarching  kinds.  Along  with  each  kind,  are  the  properties 
common  to  all  and  only  members  of  the  kind,  including  the  specific  property  values  of  such 
attributes  as  mass,  charge,  spin,  and  so  on  which  all  the  members  share.  Again,  in  biology, 
one  finds  perhaps  the  foremost  example  of  classifications  into  kinds  and  subkinds  and 
charaaerizations  of  the  distinctive  properties  associated  with  each  kind. 
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The  third  task  of  ontology  becomes  more  relevant  in  contexts  where  we  want  to  be  able  to 
characterize  specific  individual  objects  and  speak  specifically  of  them  and  their  propenies. 
A  basic  metaphysical  distinction  is  especially  useful  in  this  regard,  namely,  the  distinction 
between  essential  and  accidental  propenies.  An  essential  propeny  of  an  object,  ,S,  is  a 
property  that  S  must  have  to  exist.  An  accidental  property  of  S,  by  contrast,  is  a  propeny 
that  S  has  but  does  not  require.  For  example,  the  number  17  has  the  property  of  being 
prime  essentially;  it  could  not  possibly  have  been  evenly  divisible  by  anything  other  than  1 
and  itself.  On  the  other  hand,  it  has  the  property  of  being  Dr  .Menzel’s  favorite  natural 
number  accidentally;  if  Dr  Menzel  hadn’t  existed,  or  if  his  affections  had  been  directed 
toward  the  number  43  instead,  it  would  have  lacked  this  property  (and  no  doubt  would 
have  been  none  the  worse  for  it).  Again,  human  beings  are  usually  thought  to  have  the 
property  of  being  human  essentially-no  one  could  have  been  a  donkey  or  a  stone  instead  of 
a  human.  On  the  other  hand,  all  of  us  could  have  been  (and  indeed,  have  been)  a  different 
height;  thus  one's  height  is  an  accidental  property. 

The  usual  notion  of  a  kind  is  a  class  of  objects  which  all  share  a  common  nature  (i.e.,  a  set 
of  properties  that  belong  essentially  to  all  and  only  members  of  the  kind).  Consequently, 
the  properties  that  make  a  thing  a  member  of  a  kind  are  also  those  that  define  its  nature  as 
an  entity.  This  definition  is,  for  the  most  part,  quite  appropriate  in  the  context  of  natural 
science  and  mathematics.  For  example,  the  most  natural  properties  for  delimiting  biological 
kinds  involve  having  a  certain  DNA  structure;  clearly,  this  will  also  be  an  essential  property 
of  the  animals  in  question  (on  the  reasonable  assumption  that  no  particular  animal  could 
have  been  a  member  of  a  different  species).  Similarly,  the  most  natural  properties  for 
delimiting  kinds  of  subatomic  particles  (e.g.,  a  certain  mass,  charge,  spin,  etc.)  will  be  in 
terms  of  analogous  underlying  structural  properties  that  are  essential  to  the  instances  of 
those  kinds. 

As  we  will  argue,  though,  this  definition  is  too  restrictive  for  use  in  the  context  of  human- 
designed  systems.  However,  there  is  a  closely  related  conception-alluded  to  briefly  in  the 
first  paragraph  of  this  section-that  is  somewhat  more  flexible  and  more  applicable  in  the 
context  of  information  modeling.  In  this  conception,  the  properties  that  define  a  kind  are 
not  necessarily  essential  properties  of  the  members  of  the  kind.  Rather,  the  membership 
conditions  only  specify  what  properties  it  takes  to  be  an  instance  of  that  kind,  irrespective 
of  whether  or  not  those  properties  are  essential  to  the  members.  Thus,  in  this  broader 
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conception,  a  kind  K  is  a  class  of  objects  consisting  of  all  and  only  those  things  that  exhibit 
a  cenain  set  of  properties,  which  we  can  call  the  defining  properties  of  K. 

An  example  will  help  to  show  how  this  conception  of  a  kind  is  the  more  useful  one  in  the 
context  of  human-designed  systems,  and  will  also  help  to  clarify  one  way  in  which  an 
ontology  might  function  in  the  course  of  information  management.  Consider  the  following 
representation  of  the  basic  ontology  of  a  manufacturing  cell  composed  of  five  entities. 
Objects  enter  the  cell  and  encounter  a  cutter,  then  a  drill,  an  inspection  station,  and  two 
cleaners. 


KINDS. 

DEFINING  PROPERTIES 

A;  Cutter 

{Has  diamond-cutting  tool,...} 

B:  Drill 

{Has  high  speed  motor,...) 

C:  Inspector 

{Has  high  intensity  lens  light...} 

D:  Cleaner  and  Painter 

{Has  dust  filters,  high  gloss  paint,...} 

E:  Cleaner 

{Has  liquid  cleaners,...} 

This  example  shows  the  kinds  of  objects  that  populate  the  system,  and  lists  the  defining 
properties  of  each  kind;  a  representative  defining  property  (or  two)  is  listed  for  each.  Thus, 
this  example  provides  an  abstract  representation  of  the  general  structure  that  the 
manufacturing  cell  must  exhibit  at  any  given  time.  The  property  has  a  diamond-cutting 
tool  is  a  defining  property  of  the  kind  Cutter.  However,  suppose  the  cutter  has  the 
capacity  to  switch  from  diamond-cutting  tools  to  carbide.  Then,  even  though  having  a 
diamond-cutting  tool  is  a  defining  property  of  the  kind  Cutter,  it  is  nonetheless  an 
accidental  property  of  the  cutter;  it  would  lack  the  property  if  someone  were  to  swap  the 
diamond-cutting  tool  for  a  carbide  tool.^  Thus,  its  role  as  a  defining  property  of  the  kind 


^  There  are,  of  course,  some  significant  philosophical  issues  involved  in  the  the  nature  of  artifacts;  some 
philosophers,  for  example,  argue  for  the  view-known  as  mereological  essentialism-xhat  every  part  of  an 
artifact,  or  physical  object  generally,  is  essential  to  it,  so  that  if  we  swap  one  cutting  tool  for  another  in  a 
cutter,  the  cutter  with  the  replaced  tool  ceases  to  be,  and  a  new  cutter  comes  to  exist.  The  puzzle  here  goes 
back  to  Greek  limes  iu  the  guise  of  the  Ship  of  Theseus;  if  we  bit  by  bit  replace  the  planks  of  a  ship  with 
new  planks,  and  simultaneously  build  a  new  ship,  bit  by  bit,  from  the  old  planks,  which  ship  is  which?  Is 
the  new  ship  idratical  to  the  original  ship  becau.se  it  has  the  same  parts?  Or  is  the  rebuilt  ship  identical  to 
the  original  ship  because  of  the  insignificance  of  each  plank  individually  to  the  identity  of  the  whole? 
Thankfully,  we  needn’t  address,  or  at  least  answer,  such  questions.  The  chief  purpose  of  ontology 
modeling,  and  information  modeling  generally,  is  not  so  much  to  divide  the  world  at  its  ontological  joints 
to  discern  its  ultimate  nature,  but  rather  simply  to  categorize  it  in  the  most  useful  way  for  the  purposes  at 
hand.  And  the  fact  is  that,  in  our  ordinary  ways  of  thinking  about  such  matters,  ordinary  objects  do  not 
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means  only  that  at  any  given  time,  the  cutter  in  the  manufacturing  cel!  must  have  a 
diamond-cutting  tool,  irrespective  of  whether  it  has  a  diamond-cutting  tool  essentially  or 
accidentally. 

The  general  point  here  is  that  things  can  belong  contingently  to  important  kinds  of  objects 
within  human-designed  systems.  The  reason  for  this  is  that  the  kinds  within  such  a  system 
are  usually  artifacts  (human  constructions)  and  hence  an  object  of  one  kind  might  “mutate” 
into  an  object  of  another  kind  simply  by  undergoing  some  nondestructive  change  (e.g.,  the 
exchange  of  cutting  tools).  Compare  this  with  an  electron  which  decays  into  two  pions  (a 
destructive  change  more  typical  in  natural  systems);  the  original  object  does  not  survive  but 
is  replaced  by  two  distinct  objects  of  a  different  kind. 

In  other  words,  we  use  the  broader  notion  of  a  kind  because  when  we  build  an  ontology 
for  a  certain  human-designed  system,  we  are  not  necessarily  attempting  to  discover  and 
classify  the  world  as  it  is  in  itself,  but  rather  trying  to  divide  and  categorize  the  objects 
within  the  system  in  useful  and  informative  ways.  An  ontology  categorization  scheme  is 
justified  only  insofar  as  it  is  useful  to  organizing,  managing,  and  representing  information 
in  the  system  so  categorized.  If  objects  of  a  certain  kind,  K,  play  a  useful  role  in  the 
system,  that  is  sufficient  justification  to  admit  them  into  the  system  ontology,  irrespective 
of  whether  the  defining  properties  of  K  are  essential  to  its  members. 

The  third  subtask  of  ontology  is  operative  in  the  above  example  as  well.  In  addition  to 
listing  and  characterizing  the  kinds  that  define  the  manufacturing  cell,  we  have  also 
discussed  the  natures  of  some  of  their  possible  instances  (e.g.,  whether  they  have  a  certain 
property  essentially  and  whether  that  property  is  a  part  of  their  nature).  This  is  no  mere 
philosophical  exercise.  It  might  well  be  crucial  in  distinguishing  the  essential  from 
accidental  properties.  The  essential  propenies  of  a  thing,  S,  put  inviolable  hounds  on  what 
is  possible  within  a  system  containing  S.  If  S  has  the  property  P  essentially,  it  cannot  fail 
to  have  it.  For  example,  a  design  that  specifies  a  kind  that  includes  a  propeny  that 
precludes  P  among  its  defining  properties  cannot  use  S  as  one  of  its  instances,  regardless 
of  how  well  it  might  meet  the  remaining  specifications. 


cease  to  exist  if  we  change  relatively  insignificant  parts.  As  a  matter  of  fact,  our  theory  will  remain  neutral 
on  this  question  and  will  permit,  though  not  require,  mercological  esscntialism  should  it  prove  useful  in 
some  contexts,  as  it  conceivably  might. 
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'ITicr.*  IS  more  to  ch^iractcn/ing  she  i»bjects  in  a  ^>Aietn  than  h'.ung  sJicit  pfopenic'*  In  itic 
context  of  a  given  system,  u  is  ciiuaUy  im}.K)rtitiu  to  dcuii  the  frUHonji  that  ubjcvis  m  the 
system  cun.  and  do.  bear  to  one  anothci  Convidefaiion'.  >>Uvh  as  shttse  ab<»sc  lead  us  u» 
distinguish  system  esseniiul  from  system  act tdenuil  fclaiiuns  A  s\ stem  esscfUial  fclatum 
relative  to  two  (or  rmare)  kinds  (K  |  and  Ks)  is  a  rrlatum  that  must  fKild  whenever  there  are 
instances  of  Kj  and  Ks  A  system  acctdeniai  rcianon  relative  to  K|  and  K^,  b>  comras!, 
IS  one  that  need  not  hold  between  all  (xissiblc  instances  t»f  those  kmds  !  or  example,  the 
nature  of  the  manufacturing  cell  depicted  above  might  require  a  certain  sort  of  mfomumonai 
link  to  be  established  between  the  cutter  and  the  drill  which  mfumis  the  dnli  of  the  syjse  of 
operation  the  cutter  has  perfonned  tm  a  given  piece  of  material  In  ontological  terms,  dm 
would  then  be  characten/ed  as  a  system  essential  relation  relative  to  the  kinds  Culfer  arwl 
Drill.  On  the  other  hand,  the  spatial  relationship  fsciwccn  sutler  and  dnll  mas  well  Ik 
irrelevant  (c.g  .  the  dnll  might  just  as  well  be  three  Icci  nonh  i>f  its  actual  location  m  the 
cell),  in  this  case,  wc  say  that  the  de  faiU'  spatial  relationship  between  suttcr  and  dnii  ts 
sy  stem  -  acc  iden  tal  ■* 

An  interesting  example  of  a  system  essential  retailor,  is  the  jKirt-of  relation  that  often  holds 
between  a  complex  object  and  some  of  its  parts  Consider  an  engine  of  a  sfiecific  design 
The  engine  can  itself  be  viewed  as  a  complex  system,  made  up  of  many  smaller  parts 
Each  of  these  pans  can  be  classified  as  instances  of  a  kmd.  as  can  the  engine  itself  CSivcn 
some  kind  of  part.  P.  that  is  necessary  to  the  design  of  the  engine.  K.  then,  relative  to  P 
and  E.  the  part-of  relation  is  system  essential  However,  given  an  instance  e  of  E  and  the 
in.stance  p  of  P  within  e.  some  other  instance  p*  of  P  would  have  done  just  as  well 
Hence,  the  part -of  relation  does  not  hold  essentially  between  the  instances  p  and  e. 

As  this  example  shows,  entire  systems  can  themselves  be  considered  funher  objects  in  yet 
larger  systems  and  can  be  characterized  as  possessing  certain  properties  (e  g  .  a 
manufacturing  cell  which  comprises  five  machines)  This  means  that  an  adequate  ontology 
tool  will  have  the  capability  to  examine  and  characterize  the  system  from  the  coarsest  to 
finest  levels  of  detail. 


‘‘Certain  facts  about  the  configuration  of  ihc  drill  or  cutter  couid.  however,  req  ire  that  ihc  two  be  ooented 
in  one  and  only  one  way.  In  this  C3.se.  the  relation  would  be  system -essential  Note  that,  just  as  defining 
properties  of  kinds  ncedn  i  be  essential  to  ihcir  instances,  in  the  same  way  entities  ihal  stand  in  system 
essential  relatkms  don’t  necessarily  stand  in  those  relations  cxscntiaHy;  though  being  spauaJiy  wicntcd  in  a 
catain  way  might  be  essenuaJ  widnn  the  system,  the  dnll  and  the  cuucr  don't  neccssanly  have  to  stand  in 
that  relation  in  any  possible  system  in  which  they  might  exist. 
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Accumulation  of  Domain  ()atolo|*ics 


Whai,  exactly,  ontology  gCHxl  tor  ’  VVt’ui  role  can  it  play  in  the  dcMgn  and  dcvciopnicni 
of  information  sysicms  '  In  what  sons  of  uitonnation  modeling  contexts  vcsii  u  be  useful  * 

One  of  the  most  important  aspects  of  the  general  development  and  use  of  the  IDHF5  method 
will  be  the  accumulation  of  a  wide  range  of  domain  ontologies  Cjcncrally.  one  of  the 
greatest  problems  in  information  management  is  inefficiency.  Redundant  effon  is 
expended  to  capture  or  re-create  infonnation  that  has  already  been  recorded  elsewhere. 
Consider  the  analogy  with  programming.  Very'  often  the  same  kinds  of  routines  (e  g  ,  in 
the  design  of  user  interfaces)  are  used  repeatedly  in  different  programs,  typically  by 
different  programmers.  Enomtous  amounts  of  time  and  effort  have  thus  gone  into 
reinventing  the  wheel  over  and  over  again.  Recognition  of  this  problem  has  led  to  the 
development  of  vast  libraries  that  have  been  collected  over  time  which  contain  oft-used 
routines  which  a  programmer  can  simply  call  into  his  or  her  program,  rather  than  having  to 
duplicate  (he  function  of  existing  code. 

Information  management  across  similar  settings  faces  the  same  sort  of  problem. 
Manufacturing  domains,  for  example,  share  many  common  features;  the  more  similar  the 
domains,  the  more  features  they  share  Rather  than  encoding  this  infonoation  again  for 
each  new  setting,  we  propose  to  develop  an  analogue  of  the  concept  of  a  programming 
library  by  collecting  this  common  information  into  ontology  libraries:  that  is,  large, 
revisable  databases  of  structured,  domain-specific  ontological  infomtaiion  which  can  be 
used  in  the  IDEF5  environment.  We  envision  numerous  advantages  to  such  libraries,  two 
of  which  are  especially  notable.  First,  domain  experts  developing  an  IDEF5  ontology  for  a 
specific  system  will  be  able  to  import  relevant  portions  of  the  general  ontology  database  for 
the  type  of  system  they  are  describing  directly  into  their  IDEF5s.  This  will  save  them  the 
trouble  of  having  to  record  the  information  directly,  thus  providing  the  analogue  to  the 
concept  of  a  programming  library.  This  information  will,  of  course,  be  malleable  so  that  a 
given  expert  can  modify  it  in  response  to  unique  features  of  his  or  her  system.  Second,  the 
information  can  be  used  to  construct  general  techniques  for  aiding  the  domain  expert  in 
extracting  domain  knowledge.  For  example,  by  isolating  and  analyzing  general  patterns  or 
features  of  ontologies  within  certain  domains,  one  can  develop  productive  strategies  for 
eliciting  and  structuring  the  sorts  of  knowledge  one  is  likely  to  find  in  those  domains.  If  a 
particular  common  type  of  machine  varies  in  cenain  details  from  location  to  location,  the 
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background  ontology  database  can  impon  the  common  information  directly,  and  lead  the 
user  through  a  series  of  questions  to  elicit  the  specifications  unique  to  his/her  domain 
Again,  an  expen  may  not  know  how  a  certain  object  should  be  classified.  By  searching  a 
list  of  essential  properties  of  the  object,  the  tool  could  return  a  set  of  kinds  in  which  the 
object  would  most  naturally  be  included. 

With  an  array  of  ontology  databases  in  use  across  a  wide  variety  of  engineering, 
manufacturing,  business,  and  logistical  systems,  the  task  of  information  modeling  could  be 
revolutionized.  Of  course,  the  construction  of  such  databases  is  an  enormous-  though  we 
believe  quite  realizable— task.  However,  there  is  an  even  more  basic  task.  Before  one  can 
build  any  complex  physical  objects  (e.g.,  a  bridge)  there  must  be  an  appropriate 
methodological  and  theoretical  foundation.  This  is  no  less  true  for  abstract  objects  like 
information  models.  That  is,  before  we  can  think  about  the  structure  of  a  domain-specific 
ontology  database,  we  need  forma!  theoretical  foundations  for  ontology  proper  (e.g.,  the 
appropriate  representational  medium)  and  methodological  foundations  for  the  capture  and 
storage  of  ontological  information. 

Ontology  and  Existing  Methods 

The  goal  of  IDEF5  is  not  to  define  yet  another  method  to  do  something  a  little  better  or  a 
little  different  than  'ome  other  existing  method.  We  have  no  interest  in  and  see  no  virtue  to 
instigating  another  skirmish  in  the  methodology  wars.  Rather,  our  first  goal  is  to  point  out 
a  gap  in  the  existing  set  of  methods:  there  is,  we  believe,  a  type  of  information-ontological 
information— that  has  not  been  directly  targeted  by  any  existing  method.  Our  second  goal  is 
to  make  some  preliminary  suggestions  for  filling  that  gap,  both  theoretically  and  practically. 

Thus  far,  we  have  outlined  the  nature  of  ontological  information.  The  importance  of  this 
sort  of  information  should  be  clear.  What  is  perhaps  less  clear  is  the  need  for  a  new 
method  to  capture  this  information.  That  issue  will  be  addressed  in  this  section. 

For  those  familiar  with  other  IDEF  methods,  the  idea  of  capturing  information  about  kinds 
and  their  associated  properties  will  no  doubt  suggest  both  IDEFl  and  IDEFIX.  A  kind  has 
been  defined  above  as  a  certain  sort  of  class;  this  might  suggest  that  a  kind  is  like  an  IDEFl 
entity  class  or  an  IDEFIX  entity.  Furthermore,  associated  with  each  entity  class  (entity)  is 
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a  list  of  associated  attributes  that  assign  propeny  values  to  the  members  of  the  entity  class. 
Perhaps  the  makings  of  an  ontology  modeling  method  are  in  one  of  these  two  methods. 

We  will  begin  by  addressing  IDEFl.  It  would  be  a  serious  error  to  think  of  IDEM  as  an 
ontology  modeling  tool  because  ontology  modeling  is  real-world  modeling.  Thai  is,  the 
members  of  kinds  are  real-world  objects,  the  actual  instances  of  those  kinds  that  exist 
within  the  system  being  modeled.  The  members  of  an  IDEFl  entity  class,  by  contrast,  are 
information  objects-they  are  objectified  clusters  of  information  about  a  system  that  must  be 
kept,  the  various  “information  images"  of  the  real-world  objects  within  a  system.  Such 
objects  are  defined  by  the  information  they  encode.  Thus,  all  the  propeny  values 
associated  with  an  IDEFl  information  object  are  essential  to  that  object;  altering  a  value 
results  in  a  new  object. 

This  view  has  two  consequences  relevant  to  ontology.  First,  there  will  generally  not  be  a 
one-to-one  correspondence  between  the  information  objects  within  an  IDEFl  model  and  the 
real-world  objects  being  modeled.  For  instance,  within  an  IDEFl  model  of  a  certain 
business  there  might  be  an  entity  class,  MANAGER,  and  another  entity  class, 
EMPLOYEE.  These  will  be  different  entity  classes  since  they  keep  different  kinds  of 
information.  Thus,  an  employee  who  is  also  a  manager  would  generate  two  distinct 
information  objects  (one  for  each  class):  one  for  the  employee  in  his/her  role  as  an 
employee,  and  another  for  that  same  real-world  employee  in  his/her  role  as  a  manager. 
Thus,  it  would  be  confusing  to  think  of  IDEFl  as  an  ontology  modeler;  it  is  simply  not 
designed  to  represent  that  kind  of  information.  Second,  since  all  the  properties  of  an 
information  object  are  essential  to  it,  there  is  no  room  for  the  distinction  between  essential 
and  accidental  properties. 

To  press  the  issue  further,  suppose  we  overlook  the  above  problems  and  use  IDEFl  as  an 
ontology  modeling  tool  to  represent  kinds  as  entity  classes.  Here  then  is  another  difficulty. 
Suppose  some  members  of  a  certain  kind  of  engine  widget  have  an  additional,  removable 
part-a  FRAMMITZ-that,  depending  on  its  location  on  the  widget,  makes  them  suitable  or 
not  for  use  in  engines  of  various  sizes.  Then,  having  a  frammitz  and  its  location  on  a  given 
widget  are  accidental  properties  associated  with  the  kind  WIDGET;  members  can  either 
have  them  or  lack  them.  Members  that  have  them  can  come  to  lack  them  but,  nonetheless, 
they  are  important  properties  to  be  aware  of  and  track.  The  inapplicability  of  accidental 
properties  in  IDEFl  has  already  been  noted.  However,  in  IDEFl,  the  No  Null  Rule  states 
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that  every  attribute  associated  with  a  given  entity  class  must  yteld  a  coirespondtng  value  for 
every  member  of  the  entity  class.  Thus,  returning  to  the  example,  location  of  Jrammiiz 
cannot  be  a  legitimate  attribute  in  an  IDEhT  representation  of  the  kind  WIDGET,  since  not 
every  widget  has  a  frammitz  {i.c.,  the  value  of  location  of  Jrammiiz  for  some  widgets  is 
null). 

In  IDEFl,  one  can  capture  the  information  in  question  without  violating  the  No  Null  Rule 
by  inventing  a  new  class  of  entity-- WIDGET_W!TH_FRAMM1TZ.  However,  there  are 
several  problems  with  this  in  the  context  of  ontology.  First,  a  matter  of  ontological 
aesthetics  and  to  paraphrase  Ockham’s  Razor,  one  should  not  be  forced  to  multiply  entity 
classes  beyond  necessity  nor  to  represent  the  information  in  question  by  introducing  an 
entirely  new  entity  class.  However,  and  more  importantly,  despite  the  significant  degree  of 
freedom  allowed  in  constructing  an  ontology  for  a  human-designed  system,  one  is  still 
constrained  to  make  natural  and  u.seful  divisions  into  kinds.  But  a  class  like 
WIDGET._WITH_FRAMMITZ  does  not  represent  such  a  division.  From  the  perspective 
of  ontology,  it  is  an  artifact  foisted  upon  the  modeler  by  the  given  modeling  tool.  The 
information  in  question  is  more  accurately  and  appropriately  captured  by  identifying  the 
class  of  frammitz-bearing  widgets  as  a  mere  subclass  of  widgets  with  members  which 
belong  to  the  class  contingently  than  by  identifying  a  separate,  overlapping  kind. 

One  might  suppose,  then,  that  we  will  fare  better  with  IDEFIX.  Although  there  is  some 
disagreement  about  the  exact  semantics  of  IDEFIX  diagrams,  it  is  clear  that  the  members  of 
an  IDEFIX  “entity”  (the  spectacularly  ill-advised  IDEFIX  term  for  a  class  of  similar 
objects  in  a  system)  are  to  be  considered  as  real-world  objects,  not  information  images  of 
those  objects  (as  in  IDEFl).  Thus,  an  IDEFIX  model  in  the  EMPLOYEE/MANAGER 
example  would  be  thought  to  contain  the  same  real-world  object  in  both  the  EMPLOYEE 
and  MANAGER  entities.  In  an  ontology  model  of  the  same  example,  the  kind 
EMPLOYEE  and  subkind  MANAGER  would  be  considered  in  the  same  way. 
Furthermore,  with  its  capacity  for  expressing  the  subclass  relation,  the  recommended 
analysis  of  the  WIDGET  example  in  the  previous  paragraph  could  be  expressed  in 
IDEFIX.  Perhaps  IDEFIX  is  the  only  model  needed. 

However,  there  are  deeper  limitations.  Chief  among  these  is  that  IDEFl  and  IDEFIX  are 
purposely  designed  with  certain  expressive  limitations  which  constrain  the  structure  of  the 
information  they  represent.  This  makes  for  very  clear,  uncluttered,  and  efficient 


information  and  data  rncxleis.  However,  it  also  limits  the  applicability  of  lDEl-1  and 
IDEFIX  outside  their  intended  domains.  The  inability  of  IDEFl  to  distinguish  essential 
from  accidental  properties  was  illustrated  above.  The  same  problem  is  shared  by  IDEFIX, 
as  illustrated  in  the  manufacturing  cell  example  above.  Suppose,  for  security  reasons,  we 
want  to  make  it  impossible  to  swap  out  the  diamond-cutter  tool  in  the  cutter.  That  is. 
suppose  we  want  to  specify  in  the  list  of  defining  propenies  of  the  kind  Cutler  that  any 
instance  must  have  a  diamond  tool  essentially.  Without  the  capacity  to  express  rntxlal 
information,  this  is  not  possible;  in  particular,  it  is  not  possible  to  express  this  in  IDEFIX. 
Nonetheless,  as  the  example  illustrates,  it  may  be  of  singular  importance  to  be  able  to 
express  such  information. 

Funher  examples  abound.  For  instance,  in  both  IDEFl  and  IDEFIX  it  is  not  possible  to 
name  individual  objects  in  an  ontology  and  assert  things  specifically  about  them.  Rather, 
one  can  only  say  things  that  hold  of  every  member  of  a  given  class  of  entities  in  general. 
This  is  a  crucial  limitation  in  cases  where  there  is  a  distinguished  member  of  a  given  kind 
with  special  properties.  Also,  more  germane  to  the  current  context,  it  effectively  rules  out 
the  possibility  of  carrying  out  the  third  task  of  ontology.  If  one  cannot  say  anything  about 
specific  objects,  he/she  cannot  discuss  the  propenies  they  have.  Again,  the  two  methods 
can  express  only  a  limited  variety  of  general  propositions  about  the  structure  of  the  entities 
within  a  given  class.  For  instance,  one  might  want  to  note  that  for  every  member  of  class 
A  with  property  P,  there  is  another  member  with  property  Q.  This  is  a  straightforward 
quantificational  statement,  easily  expressed,  for  example,  in  predicate  logic.  Once  again, 
this  proposition  is  beyond  the  expressive  capabilities  of  IDEFl  and  IDEFl  X.  However,  as 
shown  in  the  previous  examples,  this  might  well  need  to  be  expressed  to  provide  a 
thorough  characterization  of  the  objects  in  a  system. 

The  overarching  point  is  that  the  existing  IDEF  methods  were  simply  not  designed  for 
ontology  modeling;  they  were  designed  to  meet  other  goals.  Again,  the  claim  is  not  that 
there  is  something  wrong  with  or  inadequate  about  the  existing  IDEFs.  They  were  simply 
not  designed  to  be  tools  for  ontology  modeling  and  should  not  be  expected  to  meet  the 
requirements  of  such  a  tool. 


Increasing  Expressive  Power 


First  and  foremost  among  the  requirements  of  an  ontology  uxjl  is  greater  expressive  power. 
This  need  will  be  met  in  the  theoretical  foundations  of  IDEF5  by  imbuing  its  underlying 
formal  knowledge  representation  language  with  the  full  power  of  first-order  modal  logic. 
The  power  of  first-order  logic  is  well-known  and  greatly  exceeds  the  expressive  power  of 
IDEFI.5  Modal  logic  extends  first-order  logic  by  introducing  modal  operators  for 
necessity  and  possibility  and  a  corresponding  set  theoretic  semantics.  This  extension, 
among  other  things,  gives  one  the  power  to  express  facts  about  essential  and  accidental 
properties  in  a  very  natural  way.  Recall  that  an  essential  property  of  .i  is  one  which  x  must 
have  to  exist. 

The  standard  set-theoretic  .semantics  for  modal  logic  is  discussed  in  terms  of  the  heuristic 
concept  of  a  “possible  world.”  The  idea  goes  back  to  the  philosopher/mathematician 
Leibniz.  Most  of  us  believe  that  there  are  many  ways  the  world  could  be  other  than  the 
way  it  is.  These  ways  the  world  could  be  can  be  thought  of  as  other  possible  worlds.  One 
way  the  world  could  be,  of  course,  is  the  way  the  world  actually  is.  Thus,  the  actual  world 
is  one  of  the  possible  worlds.  Unlike  possible  worlds,  though,  it  is  actual,  not  merely 
possible.  An  object  S  is  said  to  exist  in  a  possible  world,  W,  just  in  case  S  would  have 
existed  if  W  had  been  actual.  An  essential  property  of  an  object  S  is  a  propeny  that  S  could 
not  have  lacked.  In  the  possible  worlds  picture,  this  can  be  defined  as;  property  p  is 
essential  to  S  just  in  case  S  has  p  in  every  possible  world  in  which  S  exists. 
Correspondingly,  p  is  accidental  to  S  if  there  is  some  world  in  which  S  exists  and  fails  to 
have  p. 

It  is  often  illuminating  to  think  of  systems  in  terms  of  possible  worlds.  In  importing  the 
enterprise  of  ontology  into  the  information  modeling  domain,  we  noted  that  our  concern 
was  not  with  the  world  per  se,  but  rather  with  the  world  of  an  organized  system. 
Accordingly,  in  this  context,  possible  worlds  should  not  be  considered  alternative  states  of 
the  world  but  as  alternative  states  of  the  system.  Thus,  a  relational  database  model  could  be 
thought  of  as  modeling,  in  one  fell  swoop,  all  the  possible  states  of  the  database  being 
modeled  (i.e.,  all  the  different  possible  relations  that  could  populate  the  database). 


5  Noninodal  first-order  logic  is  developed  and  di.scus.sed  in  some  detail  in  'Theoretical  Foundations  for 
Information  Representation  and  Constraint  Spccificauon,"  Armsuong  Laboratory  Repon  AL-TP-1991- 
0044,  October  1991. 
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Thinking  in  these  temms  often  helps  one  to  design  more  breadth  and  flexibility  into  the 
model  in  anticipation  of  possible  but  unlikely  or  previously  unconsidered  states.  In 
addition  to  providing  a  definition  of  essential  and  accidental  propenies,  the  possible  worlds 
picture  helps  anticipate  or  consider  all  possible  natural  kinds  that  might  appear  within  the 
system;  thus,  it  defines  a  sufficiently  broad  ontology. 

A  caveat  is  necessary  here  to  head  off  a  potential  misconception.  The  intuitive  concept  of  a 
possible  world  might  suggest  the  idea  of  completeness  or  totality:  a  world,  after  all.  is  a 
total  system,  complete  in  every  detail.  However,  the  use  of  worlds  in  our  fomial  apparatus 
might  suggest  that,  for  us  to  have  an  acceptable  model  of  a  given  system,  we  must  capture 
every  piece  of  information  within  the  system  down  to  its  last  detail.  But  then 
informationally  incomplete  models  like  the  simple  manufacturing  ontology  model,  M, 
above  will  not  do;  we  will  have  to  fill  in  all  the  informational  details  before  we  have  an 
acceptable  model.  For  example,  in  a  system  represented  by  M.  each  machine  consists  of 
parts  not  mentioned  explicitly  in  the  model.  Also,  each  part  meets  certain  specifications  that 
were  not  mentioned,  has  a  certain  origin  (e.g.,  a  particular  vendor)  that  was  not  mentioned, 
and  so  on.  Practically  speaking,  this  descending  chain  of  information  is  unending. 
Similarly,  any  two  objects  within  the  system  can  in  principle  be  regarded  as  a  funher 
object.  There  is  often  a  call  for  such  representations.  For  example,  in  a  system,  M,  the 
cutter  and  drill  may  be  integrated  in  such  a  way  that  it  is  useful  to  regard  them  jointly  as  a 
single  object;  yet,  no  such  object  is  represented  in  M.  Hence,  the  notion  of  a  world  seems 
to  put  far  too  great  a  demand  on  the  nxxleling  enterprise. 

Fortunately,  this  is  not  a  genuine  problem.  The  notion  of  a  world  should  not  be  taken  too 
literally.  Formally  speaking,  worlds  are  just  indexed  structures  that  (in  a  modeling  context) 
represent  possible  or  successive  states  of  a  system.  These  structures  themselves  can  be  as 
sparsely  or  as  richly  detailed  as  the  modeler  desires,  depending  on  how  much  detail  he  or 
she  wishes  to  capture.  In  particular,  a  formalized  version  of  the  model  M,  with  just  that 
much  detail,  would  be  a  fully  acceptable  "world."  Since  there  is  no  finite  upper  bound  to 
the  amount  of  detail  that  can  be  stored  within  this  framework,  one  can  add  detail  or  new 
objects  in  whatever  fashion  is  deemed  appropriate. 

The  efficacy  of  the  framework  of  possible  worlds  is  witnessed  by  the  fact  that  it  is  more  or 
less  the  framework  chosen  by  the  members  of  the  International  Organization  for 
Standardization  (ISO)  working  group  for  characterizing  the  notion  of  a  conceptual  schema. 
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A  conceptual  schema  consists  of  all  the  necessary  propositions  that  hold  in  a  given  system, 
all  possible  worlds,  or  ail  possible  states  of  the  system.  Thus,  our  use  of  the  framework 
ties  in  naturally  with  our  work  on  the  development  of  the  three-schema  architecture. 
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Part  11:  Methodological  Foundations 


Our  methodological  experience  in  ontology  development  is  based  on  practical  industrial 
applications  with  Chrysler,  Sematech,  and  our  work  on  the  emerging  Air  Force  IDEF5 
ontology  description  capture  method.  IDEF5  encapsulates  the  best  practice  experience  in 
ontology  development  of  the  information  management  community  at  large  to  date.  The 
worik  with  Sematech  took  place  in  the  manufacturing  and  engineering  domain;  the  work 
with  Chrysler  occurred  in  the  product  design  domain.  The  experiences  at  both  companies 
in  developing  ontologies  was  found  to  be  remarkably  similar.  The  still-formative 
methodology  sketched  below  is  based  on  this  experience  Broadly  stated,  the  procedure 
consists  of  the  following  five  steps  (brief  annotations  follow  the  statement  of  each  step). 

Step  1  -  Scope  domain  and  collect  raw  data.  This  task  is  responsible  for:  1 )  determining 
the  boundaries  of  a  domain,  2)  performing  interviews  with  the  domain  experts,  and  3) 
collecting  samples  of  data  representative  of  the  inputs,  controls,  policies,  knowledge,  and 
products  of  the  domain. 

Step  2  -  Develop  initial  proto-kinds.  This  task  is  responsible  for  the  analysis  of  raw  data  to 
generate  a  tentative  relation-poor  ontology  of  prom-kinds,  proto-situations,  and  proto¬ 
situation  types.  By  a  relation-poor  ontology,  we  mean  that  system-essential  relations  of 
kinds  are  not  yet  considered  in  detail  at  this  point  (see  the  annotation  to  Step  4  below).  By 
a  prom-kind  (-situation,  -situation  type)  we  mean  a  tentative  kind  (situation,  situation  type) 
generated  from  observation  and/or  a  cursory  analysis  of  existing  sources  of  information. 
This  provides  a  very  useful,  albeit  defeasible,  rough  draft  ontology  to  guide  further  inquiry 
and  analysis. 

Step  3  -  Refine  initial  analysis.  This  task  is  responsible  for  validating  initial  proto-kinds 
and  generating  a  more  stable  (but  still  relation-poor)  ontology  from  the  tentative  ontology. 
Further  inquiry  and  analysis  guided  by  the  tentative  ontology  gradually  yields  a  revised  and 
more  stable  ontology.  Stability  is  of  course  a  relative  notion.  Our  experience  confirms, 
however,  that  careful  analysis  can  come  close  to  the  ideal. 

Step  4  -  Add  relations.  This  task  is  focused  explicitly  on  adding  system-essential  relations 
to  the  ontology.  The  chief  reason  for  focusing  on  system-essential  relations  is  to  prevent 
an  unwarranted  explosion  of  complexity.  If  a  significant  number  of  relations  are 
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introduced  into  the  tentative  ontology,  it  can  become  extremely  messy  to  untangle, 
reassess,  and  refine  the  initial  relational  connections.  Furthermore,  adding  relations  early 
can  be  misleading,  since  the  ostensible  occurrence  of  a  relation  involving  a  nongenuine 
kind  can  prejudice  a  modeler’s  assessment  of  the  reality  of  that  kind. 

Step  5  -  Validate  stable  ontology  using  raw  data.  This  task  is  responsible  for  validating  the 
stable  ontology  by  taking  the  initial  raw  data  and  attempting  to  instantiate  it,  i.e.,  model  it 
within  the  stable  ontology.  Where  this  does  not  prove  possible,  or  where  it  proves 
inordinately  awkward,  the  ontology  is  modified  appropriately. 

At  each  step  in  the  process,  the  results  will  be  distributed  for  peer  review  and  comment.  In 
our  experience.  Steps  1,  3,  and  4  were  found  to  work  very  well  in  team  contexts.  Step  2, 
the  move  from  a  tentative  ontology  to  a  stable  one,  appears  to  be  most  effectively  done  by 
an  individual.  As  opposed  to  team-oriented  steps,  fairly  refined  attunement  to  certain 
patterns  within  the  system  seem  to  appear  when  an  individual  works  alone  to  develop  the 
tentative  ontology  from  the  raw  data. 
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Part  III:  The  IDEF5  Description  Development  Environment 


Levels  of  Data  Entry 

First-order  logic  is  powerful  and  efficient;  however,  a  good  bit  of  exjxirience  is  required  to 
master  the  art  of  translating  ordinary  language  into  it.  Thus,  we  envision  an  environment 
that  will  permit  several  levels  of  data  entry.  Those  familiar  with  logic  should  be  able  to 
enter  information  in  that  format  directly.  A  level  up  from  direct  entry  will  be  the  possibility 
of  graphical  entry.  Several  graphical  representations  of  first-order  logic  have  been 
developed;  many  explicitly  for  the  end-of-knowledge  representation.*  We  will  be  drawing 
on  this  work  to  develop  a  graphical  representation  of  first-order  modal  logic.  The  modal 
component  in  particular  will  require  work  beyond  what  is  currently  available.^ 

In  conjunction  with  the  graphical  language,  we  will  incorporate  the  capability  for  guided, 
structured  text  entry.  The  form  of  such  entries  will  be  midway  between  straight  first-order 
modal  logic  and  unconstrained  natural  language.  We  are  fully  cognizant  of  the  severe, 
perhaps  intractable,  difficulties  of  full  natural  language  processing  (NLP)  and  do  not  expect 
that  we  will  be  able  to  develop  a  full-blown  NLP  component  for  the  IDEF5  environment 
(though  it  will  certainly  be  capable  of  incorporating  the  current  state-of-the-art).  However, 
our  experience  and  that  of  others  in  developing  constrained  natural  language  environments 
has  shown  that  users,  with  relative  ease,  learn  to  express  their  thoughts  within  certain 
syntactic  guidelines.®  Developing  such  guidelines  in  the  IDEF5  environment  will  then 
permit  entry  of  data  in  a  manner  that  is  relatively  natural  and  easy  to  learn,  but  which  is 
either  immediately  processable  by  the  software  or  easily  converted  into  processable  form. 
The  facility  will  include  on-line  guidance  for  proper  entry  and  an  appropriate  amount  of 
built-in  syntax  checking  to  assist  the  user  without  confusing  or  defeating  him  or  her. 


*  See  [Sowa  84]  and  [Burch  91  ]  for  two  representative  examples. 

^  One  of  the  research  team's  stronger  areas  of  expertise  is  in  the  area  of  modal  logic.  Cf.  C.  Menzcl,  “The 
True  Modal  Logic,"  forthcoming  in  the  Journal  of  Philosophical  Logic',  also  C.  Menzel,  “Actualism, 
Ontological  Commitment,  and  Possible  World  Semantics,"  forthcoming  in  Synthese. 

®Cf.  P.  Mayer,  “A  Computational  Approach  for  Processing  Locative  and  Temporal  Information  in  Clinical 
Medical  Records,"  unpublished  Ph.D.  dissertation.  Department  of  Computer  Science,  Texas  A&M 
University,  1989;  also  P.  Mayer  el  al.,  “Locative  Inferences  in  Medical  Texts,”  Journal  of  Medical  Systems 
11,68-85,(1987). 
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Finally,  the  IDEF5  environment  will  also  allow  straight  text  entry  for  those  unfamiliar  with 
the  graphical  or  first-order  languages,  and  quick  collection  of  domain  knowledge  that  can 
be  analyzed  more  formally  at  a  later  time. 

Hooks  to  Other  Methodologies 

Our  chief  goal  in  developing  and  extending  the  suite  of  methods  is  data  integration.  Thus, 
we  envision  the  IDEF5  environment  itself  to  be  smoothly  integrated  with  the  other  IDEF 
method  support  tools,  as  well  as  with  tools  developed  for  other,  related  methods  such  as 
ER^  and  NIAM.^®  Our  efforts  are  thus  geared  toward  the  development  of  a  comprehensive 
information  modeling/knowledge  engineering  environment  capable  of  storing,  integrating, 
and  reasoning  with  information  across  various  types  of  domains. 


Entity-Relationship  (ER)  modeling  method. 
^^Nijssen  Information  Analysis  Method  (NIAM). 


I 
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Part  IV:  Formal  Foundations 


In  this  section  we  provide  a  formal  language  and  model  theory  for  ontology,  and  indicate 
the  roles  of  the  various  elements  of  the  fomialization  {i.e.,  to  which  aspects  in  the  above 
informal  development  they  correspond). 

Model  Theory 

We  begin  with  the  notion  of  a  basic  ontology  model  structure  (boms),  which  is  a 
representation  of  a  system  ontology  (at  some  level  of  development  and  detail).  More 
precisely,  a  boms,  M,  is  an  8-tuple  •D,W,(g),d,£,K,R,p6  in  which  D  and  W  are  mutually 
disjoint  nonempty  sets,  @(EW,  d  ;  W  /E  Pow(D)  (i.e.,  d  is  a  function  from  W  into  the 
power  set— set  of  all  subsets— of  D).  Intuitively,  D  is  the  set  of  all  possible  individuals;  W 
is  the  set  of  all  possible  worlds  or,  more  relevantly,  all  possible  states  of  a  given  system; 
and  @  is  the  actual  world,  or  actual  system  state.  Then,  d  must  be  considered  a  function 
that  assigns  to  every  possible  world,  wOEW,  the  subset  of  D  that  consists  of  the  possible 
individuals  that  exist  in  w.  Consequently,  d(w)  is  called  the  domain  of  w. 

The  last  four  elements  of  M  need  a  little  more  discussion.  First,  for  all  natural  numbers,  /i, 
let  Frt  be  {f  1  f :  W  yE  Pow(D^)};  i.e.,  the  set  of  all  functions  from  W  into  the  set  of  all  sets 
of  n-tuples  of  elements  of  D.  F^  is  the  standard  possible  world  semantical  definition  of  the 
set  of  all  n-place  relations;  in  particular,  Fj  is  the  definition  of  the  set  of  all  properties.  The 
idea  behind  this  definition  is  that,  whatever  properties  ultimately  exist,  it  is  intuitively  clear 
that  corresponding  to  each  property  in  any  given  world  is  the  set  of  all  the  things  in  that 
world  which  have  the  property.  For  e.xample,  corresponding  to  the  property  redness  in 
the  actual  world  is  the  set  of  all  things  that  are  actually  red;  in  another  world,  there  is  a 
different  set.  This  suggests  that,  rather  than  seeking  any  deeper  analysis,  we  simply 
identify  redness  with  these  varying  sets,  or  more  precisely,  that  we  identify  it  with  a 
function  that,  in  each  world  w,  picks  out  exactly  the  red  things  in  w.  To  have  the  property 
redness  in  a  given  world  w  is  thus  simply  to  be  in  the  set  of  things  (the  red  things,  of 
course)  picked  out  by  the  property  in  w,^  ^ 

^'Xhis  is  the  standard  "possible  worlds"  definition  of  properiie.s  and  relations.  The  account  has  suffered 
much  criticism  from  philosophers  and  linguists  of  late  because  it  is  coarse-grained,  i.e.,  properties  and 
relations  that  pick  out  the  same  sets  in  all  possible  worlds  are  identical.  However,  intuitively,  the 
objection  goes,  properties  and  relations  can  be  necessarily  coextensive  without  being  identical,  e.g.,  the 
properties  triangularity  and  trilaterality.  Though  iTiportant,  it  is  our  belief  that  these  issues  do  not 
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In  this  account  of  pro^Kriics  and  relations,  the  function  d  nt  M  which  assigns  a  donum  of 
objects  to  each  world,  w(l:W,  is  a  pr«.>pcriy,  natneh  the  proptnv  etislence  It  is  a 
function  which  assigns  to  each  world,  w  ,  the  set  of  objects  that  cssst  in  w  llic  dssunciion 
between  essential  and  accidental  projserties  is  captured  straightlorwaidl)  in  this  frainework 
As  noted  above,  intuitively,  an  object  has  a  pro^icny.  p.  cssetuiaiiy  tjust  in  case  n  has  u  in 
every  possible  world  sn  which  it  csists)  and  p  accidentally  ijusi  in  case  there  is  some  world 
in  which  it  exists  but  lacks  pi.  This  iransiaics  as  follows  an  object  a  has  the  property 
pCEFj  essentially  For  all  w(TW  such  that  cKTwUw  )  is  c  .  for  all  worlds  in  which  a  exists  ). 
aCEp(w));  and  for  relations  generally,  objects  uj,  stand  in  the  relation  r(l-.{>j 

essentially  for  all  w  such  that  aj,  ..  u^d.dtw).  uj,  ,  ,  ciflfKT.n  w  )  Sinnlarh  ,  a  has  tfic  p 
accidentally  m  case  w(TW  such  that  sKTdtw)  but  uurptw  > 

Given  the  definition  of  f-,.  we  can  sjsccify  the  ch,iracier  of  the  rcnsaining  elements  of  M 
First,  we  stipulate  that  fTF.Fj  i  c..  that  ^  is  a  two  place  relation  on  jKissiblc  individuals- 
and  that  for  each  w(FW.  £(w  i  is  a  reflexive  par luil  orderini;  on  the  domain  dtw  i  of  w  'Iftat 
is,  writing  u£vw/>  for  •ti.bCXIii't  w  j.  for  all  ,>i*a  ireflcxivity).  and  for  all 

a,^,c(Ed{w).  if  arid  ’^c  :  (transiiiviryi  Intuiiivciy.  £  represents  the  pari- 

whole  relation;  thus,  for  ail  w(F.W.  £(w  )  is  the  set  of  pairs  a,M)  (F,  dtw  i  such  that  a  is  a 
part  of  h  in  the  world  or  ,systcm  >uUv  •»  T’.us  ti^h  can  be  read  as  a  is  a  part  of  h  in  w 
We  wnie  a<^h  if  aLy,h  and  aiih.  and  say  that  a  is  a  proper  part  of  h  m  w  if  a<^h.  We  also 
say  that  a  is  simple  in.  or  relative  to.  w  if  a  has  no  proper  parts  in  w  If  a  is  not  simple  in 
w,  then  we  say  that  a  ts  complex,  or  a  system.  :n  w  .’- 

As  stressed  above,  part-whole  relations  are  crucial  for  the  accurate  representation  of 
physical  systems,  especially  manufactunng  and  engineering  systems,  and  this  additional 
structure  imposed  on  the  objects  of  each  possible  world  (possible  system  state)  captures 
those  relations  in  a  simple  but  powerful  way.  Note  that  since  the  relation  is  partial,  it  can 


typically  affect  ontology  or  informat.on  mtxkling  generally,  and  hence  the  complcsiiics  of  fincr-gramcd 
accounts  can  be  avoided.  Cf.  e  g..  J  Barwisc  and  J  Perry  .  Situunnns  and  Aintu  ks  (Cambridge.  MIT 
Press/Bradford  Books,  198.1).  ch.  2;  G.  Bcalcr.  Quahn  and  Cnmept  (Oxford.  Oxiord  Univcrsily  Press, 
!980).  ch.  2. 

12Tbc  idea  of  adding  additional  algebraic  suiicturc  on  each  world  s  domain  of  individuals  locaplure  the  part- 
wholc  relation  was  inspired  by  the  work  of  Godchard  l  ink  on  the  semantics  of  plurals.  Link  imposes  a 
full-blown  boolean  algebra  on  the  individuals  to  provide  irtcrprctations  for  a  wide  variety  of  plural 
phenomena  in  natural  language,  and  this  seems  lo  he  far  m  irc  structure  than  is  necessary  for  present 
purposes  here.  Link  also  resuicts  his  attention  to  nonmtxlal  contexts  See  G.  Link,  "The  Logical  Analysis 
of  Plurals  and  Mass  Terms:  A  Lauicc  Theoretic  Approach."  in  R.  Baueric  ct  al.  (cds).  Meaning.  Use.  and 
Interpretation  (Berlin,  DcGruytcr,  1983) 
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be  as  elaborate  or  as  sparse  as  required;  everything  from  the  empty  relation  to  a  linear  well¬ 
ordering  counts  as  a  partial  ordering.  The  requirements  of  reflexivity  and  transitivity 
guarantee  only  that  every  object  is  a  part  of  itself,  and  that  the  pans  of  the  pans  of  an  object 
a  are  also  pans  of  a.  In  particular,  because  models  need  not  be  complete  descriptive 
representations,  the  pan-whole  relations  between  objects  in  a  model  can  be  as  detailed  or  as 
sparse  as  desired.  This  makes  for  great  flexibility  in  the  development  of  models,  since  it 
allows  one  to  add  pan-whole  infomiation  incrementally  in  the  construction  of  a  mode!  to 
whatever  extent  is  deemed  necessary.  Note  also  that  the  pan-whole  relation  needn't  hold 
essentially  between  two  objects.  That  is,  it  is  perfectly  consistent  within  a  model  for  a  to 
be  a  part  of  b  in  one  world  w  and  for  a  not  to  be  a  part  of  b  in  another.  This  implements 
the  idea  discussed  above  (see  footnote  2)  that,  intuitively,  most  complex  objects  do  not 
have  all  of  their  pans  essentially. 

The  sixth  element,  K  ^  Fi_  is  a  set  of  properties  that  represent  the  kinds  within  a  system; 
hence,  the  members  of  K  are  called  the  M-kinds  or  kinds  of  M.  In  our  informal 
development  above,  kinds  were  identified  with  classes,  which  are  usually  considered 
collections  of  some  kind.  However,  kinds  cannot  be  thought  of  as  mere  collections 
because  they  transcend  their  members;  the  nature  of  a  kind  is  not  altered  if  its  instances 
change.  This  is  precisely  the  feature  of  properties  noted  above  that  distinguishes  them  from 
sets.  Thus,  in  our  more  precise  development,  kinds  are  best  identified  with  certain 
distinguished  properties,  and  hence  K  is  stipulated  to  be  a  subset  of  the  set  of  properties 
Fi. 

The  seventh  element,  R,  of  M  represents  the  system-essential  relations;  hence,  we  stipulate 
that  R  O  »„>  2F;,,  i.e.,  that  R  is  a  subset  of  the  set  of  all  2-or-more-place  relations.  The 
final  element,  p,  is  defined  as  a  function  of  K»R  such  that  p  :  K  /E  Pow(Fi-K),  and  p  :  R 
/E  »n>2^n  such  that  for  all  «-place  relations  r(ER  (n>2),  p(r)(EK„.  Therefore,  the  role  of 
p,  intuitively,  is  to  map  each  kind,  k,  to  the  set  of  its  defining  properties  of  k,  and  to  map 
each  relation,  r,  in  R  to  the  kinds  relative  to  which  r  is  system-essential  in  w.  That  no  kind 
is  the  defining  property  of  some  other  kind  (or  itself,  for  that  matter)  is  ensured  by  the 
stipulation  that  p  maps  K  into  Pow(Fi-K),  rather  than  PowCFj)  simpliciter.  The  stipulation 
that  p  be  one-to-one  assures  that  no  two  distinct  kinds  have  precisely  the  same  defining 


*^Note,  however,  that  having  a  filter  might  be  a  defining  property  of  the  kind  cleaner,  so  that  in  any  .state 
w  of  a  giver,  system,  all  cleaners  must  have  filters.  We  have  dwelled  on  this  point  already  above,  but  if  an 
ontology  i.s  not  to  be  muddled,  it  is  crucial  Uiat  the  distinction  be  clear. 
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properties.  Note  that  the  defining  properties  of  one  kind,  k,  might  constitute  a  (proper) 
subset  of  the  defining  properties  of  another  kind,  k0,  so  that  every  instance  of  k0  is  an 
instance  of  k.  In  such  a  case,  we  say  that  k0  is  a  subkind  of  k.  For  example,  one  might 
wish  to  define  a  general  kind,  cutter,  and  two  separate  subkinds,  diamond-tool  cutter 
and  carbide-tool  cutter,  which  were  obtained  by  adding  additional  propenies  to  the 
more  inclusive  kind.  By  defining  the  defining  properties  of  a  kind  independent  of  any 
world,  we  build  in  the  idea  that  they  are  essential  to  it.  One’s  conception  of  a  panicular 
kind  might  change  over  time,  of  course,  but  this  can  be  represented  in  terms  of  a  series  of 
several  distinct  but  related  kinds. 

Further  stipulations  about  the  behavior  of  p  must  be  made  in  order  to  assure  that  defining 
properties  and  system-essential  relations  are  represented  correctly  in  M.  Specifically,  we 
add  two  conditions  on  p.  First,  if  kOEK,  then  for  all  pCEp(k),  k(w)  ^  p(w),  for  all  wCEW, 
i.e.,  in  any  world  w,  every  member  of  the  kind  k  in  w  must  have  the  property  p.  Second, 
in  the  same  manner,  for  any  n-place  relation  rfER  such  that  p(r)  =  -ki, ...,  k„0,  for  any 
wCEW,  if  kj(w)jtA  for  all  i  such  that  l£i£n,  then  there  are  a\, ...,  (3;i(Ed(w),  a,CEk/(w), 
l£j£rt,  such  that  a], a;,6(Er(w).  This  condition  captures  the  system-essentiality  of 
system-essential  relations.  Specifically,  the  condition  states  that  for  any  world,  w, 
whenever  each  of  tne  kinds  relative  to  which  r  is  system-essential  has  at  least  one  member 
in  the  domain  of  w,  then  r  holds  between  members  of  those  kinds  in  w. 

An  important  relation  that  can  occur  between  models  is  that  one  can  be  embedded  in  another 
in  the  sense  that  all  the  information  in  one  model  is  preserved  in  another  model  which 
contains  more  information.  If  a  model,  M,  is  so  embedded  in  another  ,M0,  we  say  that  M 
is  a  submodel  of  M0.  This  sort  of  situation  can  arise  in  at  least  two  ways.  First,  it  is  an 
essential  fact  of  the  modeling  enterprise  that  models  evolve  over  time.  One  of  the 
circumstances  under  which  this  happens  is  when  an  existing  model  must  be  augmented  in 
ht  of  new  information.  Another  is  when  one  might  purposely  filter  information  to  obtain 
a  simpler,  more  coarsely  grained  model— after  all,  not  all  available  information  is  useful  in 
all  contexts;  thus,  one  might  freely  filter  the  information  in  a  given  comprehensive  model  in 
a  variety  of  ways  to  obtain  many  different  submodels. 

To  begin,  assume  that  M  =  •D,W,@,d,£,K,R,p6  is  a  substructure  of  M0  = 
•D0,W0,@0,d0,£0,K0,R0,p06  if  and  only  if  D  ^  D0,  W  ^  W0,  @  =  (3)0,  and  d(w)  = 
d0(w)«D,  for  all  wCEW.  Suppose  M  is  a  substructure  of  M0,  let  r(E»nF;,  be  an  n-place 

1 

i 
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relation  of  M,  and  let  r<S  be  an  «-place  relation  of  Mtt.  Then  we  say  that  r  is  the  restriction 
of  r0  to  M,  written  r0?tM,  just  in  case,  for  all  w(EW,  r(w)  =  r0(w)«D.  M  is  a  submodel  of 
M0  just  in  case  M  is  a  substructure  of  M0;  £  =  £0?tM;  for  each  kCEK  there  is  a  k0CEK0 
such  that  k  =  k0^tM  (such  a  k0  is  called  a  correlate  of  k  in  M  0)  and  similarly,  for  R  and 
R0;  for  each  kCEK,  and  for  each  pCEp(k),  there  is  some  p0(Ep0(k0)  such  that  p  =  p0?tM, 
where  k0  is  a  correlate  of  k  in  M0;  and  for  each  n-place  i<ER,  p(r)  =  k^l ,  k0,6,  where 

l£/£n,  is  a  correlate  of  k  in  M0. 

That  is,  M  is  a  substructure  of  M0  if  the  individuals  and  worlds  of  M0  include  those  of  M 
they  share  the  same  actual  world,  and  the  individuals  that  exist  in  a  world  of  M  are  exactly 
those  individuals  of  M  that  inhabit  that  world  in  M0.  Thus,  all  the  individuals  that  inhabit 
that  world  according  to  M  also  inhabit  it  according  to  M0,  though  M0  may  include  new 
individuals  in  that  world  as  well.  The  remaining  conditions  that  must  be  met  in  order  for  M 
to  be  a  full-blown  submodel  of  M0  simply  spell  out  the  idea  that  the  prof>erties  and 
relations  of  M— in  particular,  the  part-whole  relations,  the  kinds,  and  the  system-essential 
relations  of  M— can  only  change  in  M0  in  ways  that  increase  information,  i.e.,  such  that 
none  of  the  information  of  M  is  lost.  For  example,  if  a  is  part  of  b  in  w  relative  to  M,  then 
a  is  a  part  of  6  in  w  relative  to  M0"though  there  may  be  some  part,  c,  of  a  in  w  relative  to 
M0  that  was  not  recognized  in  M  because  c  is  not  among  the  individuals  in  M.  This 
corresponds  to  a  situation  in  which  M0  represents  a  finer-grained  representation  of  a 
system  also  represented  by  M.  Again,  a  kind,  k,  may  have  more  defining  propenies  in  M0 
than  it  had  in  M,  but  those  in  M0  that  have  correlates  in  M  will  still  be  true  of  all  the  objects 
in  M0  that  they  were  true  of  in  M  (plus  perhaps  some  that  were  not  among  the  individuals 
of  M).  It  may  be,  however,  that  certain  defining  properties  of  k  in  M0  were  not  recognized 
in  M  because  they  only  appear  at  a  finer  level  of  granularity,  or  because  of  some  other  shift 
in  perspective  not  captured  by  M. 

Languages  for  Ontology 

In  this  section,  we  present  the  formal  language  for  ontology  and  discuss  the  development 
of  more  user-friendly,  graphical  languages  for  use  in  the  IDEF5  description  development 
environment. 

As  noted,  the  formal  IDEF5  language  L  will  be  a  modal  extension  of  first-order  logic.  It 
will  thus  consist  of: 


24 


a  possibly  infinite  store  of  individual  constants  ci,  C2 . 


•  individual  variables  vi,  V2, 

•  n-place  first-order  predicate  constants  P”,  P^,  and 

•  n-place  predicate  variables  Fp  F",  .... 

for  any  or  all  n  as  desired,  though  it  is  required  that  L  at  least  contain  ail  variables  and  the 
predicates  pf,  which  will  ordinarily  be  written  as  =,  as  well  as  the  predicate  P^,  which  will 

be  written  as  e.  e  will  express  the  part-whole  relation  in  L.  In  addition,  L  will  contain  the 
standard  logical  operators  $  (existential  quantifier),  (negation),  and  &  (conjunction),  as 
well  as  the  modal  operator  $  (possibility).  L  differs  from  typical  first-order  modal 
languages  because  it  contains  predicate  variables  as  well  as  distinguished  higher-order 
predicates  KIND,  DP,  and  SER;,  {n>2)  which  express  the  property  of  being  a  kind,  the 
relation  between  a  kind  and  its  defining  properties,  and  the  relation  between  a  system- 
essential  relation  and  the  kinds  relative  to  which  it  is  such,  respectively. 

The  syntax  of  the  formal  language  will  also  be  standard,  modulo  the  special  higher-order 
predicates. 

•  If  P  is  an  n-place  first-order  predicate  (constant  or  variable)  and  ti,  ...,  any  n 
terms  (i.e.,  constants  or  variables),  then  Pti, ...,  t^  is  a  (first-order  a.tomic)  formula 
(ofL). 

•  If  P  is  a  one-place  first-order  predicate,  then  KIND(P)  is  a  (second-order  atomic) 
formula. 

•  If  P  and  Q  are  one-place  first-order  predicates,  then  DP(P.Q)  is  a  (second-order 
atomic)  formula. 

•  If  P  is  a  two-place  first-order  predicate  and  Pi, ...,  P„  (n>2)  are  one-place  first-order 
predicates,  then  SER„  (P,Pi, P^)  is  a  (second-order  atomic)  formula. 
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If  j  and  y  are  formulas,  so  are  -tf,  t f,  and  (f  &  y). 


•  If  j  is  a  formula  and  c  is  any  variable  (individual  or  predicate),  then  $aj  are  formulas. 
We  define  the  other  standard  logical  operators  in  the  usual  way: 

•  f/y  =df  &  -’y).  f  ^  y  =(lf  -'(f  &  -ly),  f  '  y  =cjf  {f  y€  y)  &  (y  /€  f),  and 


•  "  af  =df  -i$a-if,  []f  =df  -it-if. 


Interpretations 

Given  a  model,  M,  and  a  language,  L,  for  ontology,  we  can  now  specify  how  L  is 
interpreted  in  M.  This  is  done  in  terms  of  an  interpretation  function,  V,  which  maps 
elements  of  L  to  appropriate  semantic  objects  of  M.  The  general  notion  of  an  interpretation 
function  is  discussed  at  length  in  [Menzel  91a],  so  we  will  not  dwell  on  details.''*  An 
interpretation  function  V  for  L  and  M  =  •D,W,@,d,£,K,R,p6  is  a  function  such  that: 

•  if  t  is  a  term  (of  L),  then  V(t )  GE  D;  ”, 

•  if  P  is  an  n-place  first-order  predicate  (constant  or  variable)  and  w(EW,  then  V(P,w) 
(E  V(Dn);  in  particular,  V(P^,w)  =  {  a.aO  I  a(Ed(w)},  and  V(P^,w)  =  £(w)  (recall, 
P^  is  to  be  the  identity  predicate  and  P^  the  predicate  that  expresses  the  part-whole 
relation),  and 

•  V(KIND)  (E  V(K);  V(DP)  =  {  k.pO  I  k  (E  K  and  p  (E  p(k));  and  V(SER„)  =  {  r.ki, 
...,  k„6  t  -ki,  ...,  k„6  (E  p(r)}. 

Interpretations  for  formulas  of  L  will  be  defined  recursively  in  terms  of  V  in  the  usual  way. 
Specifically,  we  define  V'  to  be  a  total  extension  of  V  such  that  V'  also  maps  the  formulas 
of  L  into  the  set  {T,F}  (truth  and  falsity)  in  the  following  ways. 


'■*0.  Menzel  and  R.  Mayer,  “Theoretical  Foundations  for  Information  Representation  and  Constraint 
Specification,”  Armstrong  Laboratory  Report  AL-lP-1991-0044,  October  1991. 
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•  If  j  is  a  first-order  atomic  formula  Ptj,  t^.  then  V'(j,w)  =  T  iff  ti,  ....  t„6  CE 
V(P,w). 

•  If  j  is  KIND(P),  then  V^(j,w)  =  T  iff  V(P)  (E  K. 

•  If  j  is  DP(P,Q),  then  V'(j,w)  =  T  iff  V(P,w)  CE  K  and  V(Q,w)  (E  p(V(P,w)). 

•  If  j  is  SERfl  (P,Pi,  P;,).  then  V'(j,w)  =  T  iff  V(P)  CE  R  and  V{Pi,w) . 

V(P„,w)6  (E  p(V(P,w)). 

•  Ifj  is -ly,  then  V'(j,w)  =T  iff  V'(j,w)  =  F. 

•  If  j  is  (y  &  q),  then  V'(j,w)  =  T  iff  both  V'(y,w)  and  V'(q,w)  =  T. 

•  If  j  is  $ay,  then  V'(j,w)  =  T  iff  there  is  a  total  extension  V  "  of  V  differing  from  V'  at 
most  in  what  it  assigns  to  a  such  that  V"(y.w)  =  T. 

•  If  j  is  ty,  then  V'(j,w)  =  T  iff  there  is  a  w'  CE  W  such  that  V'(y,w')  =  T. 

Axioms  for  Ontology 

A  proper  axiomatic  basis  that  captures  the  logic  of  our  ontology  naodels  will  be  needed  as  a 
basis  for  developing  computational  tools  with  a  capacity  for  automated  reasoning.  In  this 
section,  we  will  describe  an  appropriate  axiomatic  basis,  though  we  will  not  explore  the 
issues  of  computational  implementation;  that  will  be  a  task  for  the  next  phase  of  IDEF5 
development. 

The  basis  for  the  system  will  be  a  fairly  weak  second-order  modal  logic.  That  is,  in 
addition  to  the  usual  propositional  tautologies,  and  axioms  for  quantifiers  and  identity,^ ^ 
we  also  have  the  usual  axioms  of  the  modal  logical  system  SS; 

K:  [l(j..y)..([]j..ny). 


^^See  again  [Mcnzel  91a].  It  should  be  noted  that  the  logic  developed  in  [Menzel  91a]  is  not  second-order; 
however,  the  quantifier  axioms  for  the  logic  in  this  report  will  wwk  in  exactly  the  same  way,  regardless  of 
whether  the  quantified  variable  is  first-  or  second-order. 
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T:  j . .  :jj,  and 

5: 

and  the  rule  of  inference  of  necessimtion: 

Nec:  If  j,  then [Ij. 

That  is,  if  a  formula  j  is  provable,  then  the  proposition  [)j  that  it  is  necessary  is  also 
provable.  This  rule  captures  the  intuition  that  anything  provable  is  a  truth  of  logic  and 
hence  should  be  true  in  all  possible  worlds  or  for  all  possible  system  states.’^ 

The  last  thing  we  need  are  axioms  that  capture  the  logical  content  of  the  distinguished 
predicates  of  the  ontology  language--i.e.,  KIND,  DP,  and  the  predicates  SER„--that  was 
given  them  in  the  definition  of  an  interpretation  above.  Thus,  we  have: 

Ol:  DP(F,G)  ..  (KIND(F)  &  ^KIND(G)), 

02:  DP(F,G)  "  x(Fx  ..  Gx), 

03;  SER„  (F,Fi,  ...,  F„)  ..  (KIND(Fi)  &  ...  &  KIND(F„)), 

04:  SER„  (F,Fi,  ...,  F„)  ..  [($xFix  &  ...  &  SxF^x)  ..  ($xiFjx|  &  ...  &  SxF^Xn  - 
Fxi...x„)], 

05:  DP(F,G)  ..  nDP(F,G),  and 

06:  SER„  (F,Fi,  ...,  F„)  ..  []SER„  (F,F,,  ...,  F„). 

01  says  that  if  the  relation  DP  holds  between  two  properties  F  and  G,  F  must  be  a  kind 
and  G  must  not.  This  captures  the  idea  that  DP  holds  between  a  kind  and  any  of  its 
defining  properties,  which  includes  the  idea  that  no  kind  is  a  defining  property  of  any  kind. 

*^Therc  is  some  doubt  about  the  soundness  of  ncct^sitation  as  a  general  modal  rule  of  inference;  however, 
cf.  C.  Menzel,  “The  True  Modal  Logic,”  forthcoming  in  the  Journal  of  Philosophical  Logic. 
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02  states  that  a  member  of  a  kind  must  have  all  the  defining  propenies  of  the  kind;  it  does 
not  say  that  the  member  must  have  them  essentially  (in  line  with  our  earlier  discussion  of 
the  notion  of  a  kind).  03  notes  that  system-essential  relations  are  such  relative  to  some 
finite  collection  of  kinds.  04  states  that  a  system-essential  relation  must  hold  between 
representatives  of  the  kinds  it  is  relative  to  whenever  these  kinds  are  nonempty.  05  and 
06  capture  two  important  modal  properties  of  kinds  and  system-essential  relations;  (1 )  if 
G  is  a  defining  propeny  of  a  kind,  F,  it  is  necessarily  a  defining  propeny  of  F;  and  (2)  if  F 
is  system-essential  relative  to  some  collection  of  kinds,  it  is  necessarily  system-essential 
relative  to  those  kinds.  These  properties  are  enforced  in  the  model  theory  by  the  fact  that 
the  values  of  the  function  p  on  a  member  of  K  or  R  were  defined  independent  of  W. 
(Recall  that  p  determines  the  defining  properties  of  a  given  kind  and  the  kinds  relative  to 
which  a  given  relation  is  system-essential.)  Note  also  that  few  properties  of  kinds  and 
system-essential  relations  follow  from  the  restriction  that  p  be  one-to-one.  These  are  not 
expressible  in  the  language  as  it  stands  becau.se  u  requires  that  we  be  able  to  express 
identity  between  properties  and  relations;  this,  in  turn,  requires  a  second-order  identity 
predicate.  Addition  of  such  a  predicate  will  be  explored  in  the  next  phase  of  IDEF5 
development. 
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